Noise Robust Talker Localization Based on Weighted Csp Analysis with an Average Speech Spectrum for Microphone Array Steering

نویسندگان

  • Yuki Denda
  • Takanobu Nishiura
  • Yoichi Yamashita
چکیده

This paper proposes a noise robust talker localization method based on weighted CSP (Cross-power Spectrum Phase) analysis with an average speech spectrum as the pre-process of microphone array steering. The proposed method consists of two processes. First, CSP coefficients are weighted by analysis weight coefficients based on an average speech spectrum, which is trained with speech database, in advance. Next, the interference noises are reduced on spatial domain by CSP coefficient subtraction. As a result of evaluation experiments in a real room, we confirmed that the proposed method could provide better talker localization performance than the conventional methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A study of weighted CSP analysis with average speech spectrum for noise robust talker localization

This paper describes a new method of noise robust talker localization for the front-end processing of microphone array steering. Conventional talker localization methods cannot localize a target talker accurately in higher noisy environments. To deal with this problem, in this paper, we propose an weighted CSP (Cross-power Spectrum Phase) analysis with an average speech spectrum. The proposed m...

متن کامل

Statistical sound source identification in a real acoustic environment for robust speech recognition using a microphone array

It is very important for a hands-free speech interface to capture distant talking speech with high quality. A microphone array is an ideal candidate for this purpose. However, this approach requires localizing the target talker. Conventional talker localization methods in multiple sound source environments not only have difficulty localizing the multiple sound sources accurately, but also have ...

متن کامل

Distant-talking speech recognition based on a 3-D Viterbi search using a microphone array

This paper focuses on microphone arrays to realize distant-talking speech recognition in real environments. In distant-talking situations, users can speak at arbitrary positions while moving. Therefore, it is very important for high quality speech acquisition using microphone arrays to localize a talker accurately. However, it is very difficult to localize a moving talker in noisy and reverbera...

متن کامل

A Talker-Similarity Function Based on Fundamental Frequency for Use in Real-Time Talker Labeling of Microphone-Array Data

A method is presented for using mean fundamental frequency to measure talker similarity in real time from conversational speech in a noisy, reverberant room. This talker-similarity function is designed with the ultimate goal of real-time talker labeling in mind. A large-aperture array of wallmounted microphones is used, and talkers are allowed to enter the room without providing prior enrollmen...

متن کامل

Estimation of Talker's Head Orientation Based on Discrimination of the Shape of Cross-power Spectrum Phase Coefficients

This paper presents a talker’s head orientation estimation method using 2-channel microphones. In recent research, some approaches based on a network of microphone arrays have been proposed in order to estimate the talker’s head orientation. In those methods, the talker’s head orientation is estimated using the sound amplitude or peak value of CSP (Cross-power Spectrum Phase) coefficients obtai...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005